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The  use  of  p'onched  card  tatulatlng  eqaipment  is  not  new:      It 
was  used  a  n\anber  of  years  ago   in  the  Burea-u  of  the  Census  for  the 
preparation  of  Correlation  tables.     More  recently  it  has  oeen  used 
in  at  least  two  of  the  Bureaus  of  the  Departnent  of  Agriculture  in 
a   similar  way.     The  writer  in  no  sense  claims  to  te  the   sole  author 
of  the  methods  herein  set  forth  except  insofar  as  the   special  tech- 
nique is  devised  to  suit  the  equipment  and  problems  of  the  Bureau  of 
Agricultural  Economics. 

Especial  credit  is  due  to  Messrs.   Tolley  and  Ezekiel  of  this 
Bureau  for  devising  the  3east   square  method   (page   I5  et  seq. )  of 
approaching  correlation  problems.      The  writer  also  wishes  to  express 
his  appreciation  to  them  for  going  over  the  following  text,    their 
suggestions  and  helpful  criticism. 

The  coefficient  of  Multiple  Correlatiion,   R,   is  a  measure  of 
the  degree  of  agreement  between  a  given  series  and  the  estimated 
(generally  forecasted)   values  of  the   same   series,  when  these  esti- 
irated  values  are  determined  mathematically  from  two  or  more  differ- 
ent  (independent)    series.*     The   following  pages  describe  the  forms 
and  arithmetic  used  in  computing  the  value  of  fi  and  in  deriving  the 
"regression  eq'xation"  or  forecasting  formula,     A  five  variable    (four 
independent)  example   is  given. 


r 

r 


♦The  coefficient  of  Multiple  Correlation  (R)  may  be  defined  as  the 
coefficient  of  correlation  (Pearsonian)  between  the  dependent  vari- 
able and  corresponding  estimates  thereof,  computed  from  the  inde- 
pendent variables;  i.  e.  if  X\  the  estimated  value  of  X^  equals 

bxA  plus  b^B  and  bjC plus  a  constant,  where  A,  B,  and  C^  are 

the  independent  variables  and  bx,  b^,  bj,  are  the  net  regression 
coefficients,  R  equals  the  r.ean  product  of  the  deviations  of  X  and 
X'  from  their  respective  averages  divided  by  the  product  of  the 
standard  deviations  of  X  and  X^ .  R  is  also  eq\aal  to  the  standard 
deviation  of  X»  divided  by  the  standard  dev?a\iion  of  X.  Also,  the 
standard  deviation  of  X'  squared  is  equal  to  the  mean  product  of 
the  deviations  of  X  and  X'  from  their  respective  averages. 
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The  data  should  oe  ceded  sc   that  tne  variation  in  each  series 
is  somewhere  between  ten  and  thirty,   depending  upon  the  discretion 
of  the  person  conducting  the  investigation.     The   saoaller  the  varia- 
tion,  the   simpler  is  the  process  of  perfoming  the  calculations,    for 
the  whole  s^iccess  of  using  Hollerith  Machines*  lies  in  the  grouping 
of  like  values.**     On  the  other  hand,    too  great  a  grouping— or  re- 
duction of  veriation--repult6  in  reduced  accuracy.     The  coding  may 
he  accomplished  either  by  subtraction  or  by  division  or  both:      Thus 
if  the   series  ^ond'^r  consi(33raticn  varies  between  120  and  175,    sub- 
tract 120  from  all  items,  and  divide  the  differences  by  5  thus  re- 
ducing tne  vari£.ticn  to  fron  0  to   11.     This  coding  of  the  series  in 
no  way  invalidates  the  calculation  of  R.     Coding  by  division  slightly 
reduces  accuracy,   but  subtracting  in  no  way  affects  accuracy. 

On  the   following  pages  are  given  the  coded  series  used  in  the 
example.     A,   B,   C,   &  D  are  the  independent  variables.     X  is  the  de- 
pendent  variable   (the  values  of  which  it  is  desired  to  estimate,   or 
"predict".)     The   series  should  be   so  arranged  that  the  dependent 
variable  is  placed  in  a  column  to  the  right  of  the  independents.     After 
the  coded  series  have  been  listed,    cress-add  the  values  for  A,   B,   C^ 
and  D,   &  X  for  each  observation  and  list  the   sum  in  the   "Check  Sum" 
column  on  the  right;     Col,    S  in  the   tabulation.     This  check  sum  column 
is  not  essential  to  the   solution  but  i  s  of  considerable  aid  in  check- 
ing the  arithmetic. 

*  There  are   several  types  of  punched  card  sorting  and  tabulating  ma- 
chines.    Those  used  in  this  Bureau  are   for  the  most  part  Hollerith 
^iachine  s . 

♦♦  Moreover,    if  the  extensions  are  to  be  made  by  hand,    figures  that 
can  be  readily  multiplied  mentally  greatly  facilitate  the  process. 
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After  the  data  has  been  coded  and  listed  as  on  the  preceding 
page*   the  vdues  are   then  punched  on  punch  cards,   the  record  number 
also  being  p-onched  for  the  ptirpose  of  identification.     There  are  . 
forty-five  coluiuis  on  a  punch  card.     In  our  e-<cample  the  record  number 
was  punched  in  columns  31-2-3;   A  in  coluims  3^-5;   ^  ^^  36-7;   ^  i^i  3^-9t 
D  in  UO-1;   X  in  U2-3;   and  S  in  UU-5.     One  card  is  used  for  each  line— 
for  each  observation,   that  is. 

After  the  cards  have  been  punched,    they  should  be   summed: 
The   sum  of  A,   of  B,    of  C,    of  D,    of  X  and  of  S  should  be  obtained;   also 
the  nuT-ber  of  cards  should  be  counted.     The  results  should  be  recorded 


jn  some   such  f«^rm  as  follorrs: 


Form  1 


S^xns  ard  Means  rf  ♦he  Variables 


#  Items 

:          A         : 

;          B 

:          C 

:        D 

:       X           ; 

s 

171        ! 

!     1302  ' 

:        ^^ 

:      ''^•^^ 

;        680 

;  77i+ 

!  4610 

Means:        : 

:    7.blU0     ' 

:   ^.125S      \ 

[   6.7153     . 

:   ^-^^^^     ; 

!  ^.5263  ; 

'  26.9591 

The  use  of  the  Check  Sum  first  becoi^es  apparent  here:  Evidently 
the  sum  of  the  sums  of  A,  B,  C,  D,  &  X,  should  eq,ual  the  Sum  of  S;  which 
is  the  case.  The  same  is  true  of  the  means  (averages).  This  checks 
the  first  additions  used  in  building  up  the  check  sum  itself;  it  also 
checks  the  accuracy  cf  the  puncing;  and  also  of  the  division  in  securing 
the  averages.   The  values  jfilled  into  the  form  above  are  for  our  example. 

*It  is  sometimes  feasible  to  do  the  coaing  by  parching  the  orignal  values 
upon  the  punch  cards.   Then  sort  the  cards  on  the  variable  to  be  coded; 
group  the  arrci^yed  values  into  the  determined  upon  classes  and  gang  punch 
each  group  in  a  new  coli'»mn  with  the  assigned  class  value — such  as  0,  S,  &c. 
The  check  sum  for  the  individual  record  tlien  can  be  prepared  by  showing 
each  card  separately  in  the  tabulator,  adding  across,  and  subsequently 
punching  upon  the  card,  after  which  the  procedure  is  as  given  above. 


/ 


\ 


The  next  step  is  to   sort  zhe  cards  upon  the  first  variable,  A; 


the  form  for  recording  the  tabulation  is  given  below: 


as- 


serted on: 


I 


Sorted   : 

^^ariable: 

Class 

Value 

(1) 


&jms 


No. 
of 
Items 
(2) 


Form  2. 


A 


Si-'m.jFxc. : 

(i)   i(6) 


B 


Sum'Ext. 
(7)    !(d) 


=  aB 


SUTO' 

ill 


i 


i^xt.-    »ini 


(10) 


(11) 


Ext.  I 
(12) 


Sum i Ext . 
(13). (lU) 


Sum 
(15) 


Ext 
(lb) 
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The  cards  being  sortei  on  the  first  variable,   A,    group  them 
into  packs — all  the  cards  of  the   lowest  value  of  A  in  the  first  pack, 
of  the  next  val-sie  of  A  in  the   second  pack  &c.   <5c..     List  in  col"umn 
(l)— Form  2— the  value  of  A  in  the  first  pack.     Tabulate  this  pack. 
On  the  first  line  in  column  (s)  write  the  nijEiber  of  cards  in  the 
pack;    in  column  (5)  write  the   sum  of  the  values  of  A  in  this  first 
pack;    in  column   (7)  write  the   siom  of  the  values  of  B  in  this  first 
pack;    in  colun:n  (9)  ^ii®   sum  of  the  values  of  C  in  this  pack;    in 
column   (11),  I);    in  crluim  ^13),   X;    in  col-umn   (I5)   S.     Take  the 
second  pack,    list  the  value  of  A  in  this  pack  on  the   second  line  of 
the  form;  and  li^t  the  corresponding  sum  values  as  for  the  first 
pack.     Hepeat  unwil  all  packs  hctve  been  so   treated.     When  this  is 
completed  make   the  extensions  for  columns  6,    8,    10,    12,   1^,    &  I6  as 
follows: 

Multiply  the  values  listed  respectively  in  columns  3,    7i   9, 
11,   13  <Sc  15  on  any  line  each  times  the  value   listed  on  the   same   line 
in  column  1.     List  the  products  so  obtained  in  columns  6,    8,    10,   12, 
l^T,   &  16  respectively.     Do  this  for  all  lines. 

Add  columns  2,   6,    8,    10,    12,    14  &  I6. 

Take  the  cards  and  sort  them  again;    this  time  on  the   second 
variable,   B.     Take  a  secoiid  sheet   (Form  2;    Sh.2);   divide  the  sorted 
cards  into  packs,   according  to   the  values  of  B  and  list  these  values 
of  successive  packs  in  column  1.     Tabulate,    list  and  extend  in  a 
manner  exactly  similar   to  that  v/hen  the  cards  T/ere   sorted  on  A,   ex- 
cept no  figures  need  appear  in  the  two  A  colurms  -5^6. 
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Sort  on  C  and  proceed  as  for  A  &  B  on  a  new  sheet  (Form  2;Sh  3)« 
No  figures  need  appear  in  the  A,  or  B  col\3iiuis:   colunons  5,   o»    7,   ^  ^• 

Sort  on  D  and  repeat  (Fohl  2;    Sh  U):     No  figures  in  the  A,  B, 
or  C  columns:   Colaamns  5  *^  10  inclusive. 

Sort  on  X  and  repeat  (Form  2;    3h  5).     No  figures  in  the  A,  B, 

C,   or  D  columns;      Columns  5  to  12  inclusive. 

(Note:      The  reason  that  an  increasing  number  of  columns  can 
be  emitted  %  t.hat  to  make  th'?  extensions  and   sum  them  would  give  fig-^ 
ures  already  computed:      Tlius  jf  v/e  sort  on  C  and  extend  its  values 
times  D,   adding  the  excensions,  we  arrive  at  the   same  figure  as  if  we 
had  sorted  on  D  and  extenaed  its  values  times  C.)     In  case  difficulty 
is  encountered  in  making  the  figures  check  to  the  check  sum  —  ex- 
plained later  in  connection  with  Form  3 — it  riay  be  advisable  to  make 
the  extensions  here  directed  to  be  omitted,   for  the   sake  of  compari- 
sons and  to  help  locate  the  errors.) 

Following  are  the   tabulaitions  of  the  five  sortings  made  in  per- 
forming the  above   steps  for  ooir  example .     Note   that  in  each  case  a 
check  is  afforded  by  aidding  up  column  2.     Tliis  should  add  to  the   total 
number  of  cards  in  the  problem  as  shown  by  the  data  on  form  1.     A 
further  check  may  be  afforded  by  adding  the   sum  columns  for  each  vari- 
able--co  lun:ns  5,  7,  9,   11,   13  &  15.     These   should  on  every  sheet  add 
tc  the  same  corresponding  figures  given  on  form  1. 


f 


Sorted  on  A 


B 
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Form  2     Sheet  #1. 


Sorted  Variable    iNumber: A : _^ 

GlnSF   7filUQ Lltema    :    Sum    ;£XtensiQn;    Sum    ;ExtensiQn;    Sum    rExtension:    Sinn    lExtfinsion:    ^imt    iF.ytp.nflir>n!    Fiiim    lETtRnflinn 


(1) 
0 

7 

10 
12 


(2)    (5) 


Sorted  on  B 


h 

5 

b 

7 
B 

9 
10 

12 


borted  on  C 


0 
1 
2 
k 

5 
7 
S 

10 
11 
12 


12 

30 

28 

6 

69 
25 


(171) 


0 

90 
iSg 

k2 
690 
312 


(6) 

0 

270 

100s 

29U 

6900 


(7)    (S) 


U6 

13U 

107 

25 
270 

121 


0 

U02 
6U2 

175 

2700 

ikiG 


(9)    (10)    (11)   (12)    (13)    (lU)    (15)   (16) 

67 
19U 

152 

29 

522 


0 

^} 

0 

1^ 

0 

190 

0 

5S2 

136 

U08 

12U 

372 

578 

203U 

972 

119 

71^ 

133 

793 

5S9 

U13U 

203 

3f 

255 

31 

217 

155 

1155 

5220 

236 

2360 

329 

3290 

20U7 

20U70 

2100 

108 

1295 

123 

IU75 

8^11 

10092 

5 

13 

12 

^ 
30 
19 
31 

6 

k 
1 

,1 

(171) 


21 

3 

7 

13 

25 

26 

21 

2S 

g_ 

(171) 


12216   (705)   539tJ   (ii»*9)   9077   (680)   50i+it   (771+)   6153   (^^lo)  37825 


lorro..!  Sheet  #2 


0  0  2^ 

13  13  70 

2k  kS  68 

155  UO5  3^3 

120  1^80  180 

95  '-^75  121 

186  1116  .  209 

U2  29^  U2 

32  256  37 

36  32U  26 

10  100  11 

12  lUU    g__ 

3655  (11U9)   US7I+   (680) 


70 

136 

1059 
720 

0O5 

la^iii 
29U 
296 
231+ 
110 

3S 


29 
50 

7^* 

169 

109 

59 
ikk 

7 

17 
8 

3 

11 


50 
1U8 
507 

H36 

295 

864 

1*9 

136 

72 

30 

-1-32 

2719 


31 

73 
57 

228 
119 

66 

123 

17 

18 
20 
U 

_J1_ 

(77U) 


0 

73 
iiU 

6sk 

U76 

330 

73s 
119 
ikk 
180 
110 
132 
3100 


122 
28U 

295 
1273 

777 
U68 

155 

137 
121 

3S 
_52_ 

(H610) 


0 
2SU 

590 
38I9 
310s 
23U0 
5328 
1035 
1096 

1089 
380 

_i2if 

1971^3 


.1 

52 
125 

133 

208 
210 
3O8 

_3i. 


0 

3 

28 

208 
625 

931 
166U 

2100 

33S8 
11^2 


166 
22 

29 
kk 

57 

^? 
86 

93 
loq 

J5. 


0 
22 

58 
176 
285 

688 
930 

1199 
U20 


39 

13 

30 

79 

82 

90 
127 

68 
lUl 

35. 


0 

316 

UlO 

6to 
ioi6 

680 

1551 
660 


Form  2  Sheet  #3. 


U5U 

65 
II+8 

563 

Uq9 
7U8 

627 

89U 

221 


0 

S5 

296 

1332 

2815 

3^*93 
598U 

6270 

983U 

3M. 


10099   (680)   U051   (77**)   5336   (»i6iO)  33»+37 


J. 

4 


c  • 


s 
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Sorted  on  D. 


Form  2  -   Sheet  M 


B 


D 


Sorted  VariaWe:Number: A \ 

, Class  Value   ;  Items:  Sum '.Extension:  Sum:  Ext  ens  ion:  S\3in;Exten3ion;  Sum:  Ex  ten  si  on:  Sum  ;Extension:  Sum       :Extension 

(1)      (2)   (5)    (6)    (7)    (S)   (9)    (10)   (11)   (12)   (13)   (lU)    (15)    (16) 


0 

28 

1 

29 

2 

6 

I 

30 
2k 

5 

3 

6 

10 

7 

12 

8 

8 

9 

3 

10 

k 

11 

5 

12 

9 

(l/i) 

Sorted  on  X 

0 

2U  ' 

1 

lU 

2 

28 

I 

12 

25 

5 

111 

6 

12 

7 

7 

8 

5 

9 

2 

10 

u 

11 

9 

12 

_li_ 

0 
29 

12 

90 
96 

15 

6o 

bU 

2? 

55 
102 


0 

29 
2U 

270 
38U 

75 

360 

5S8 
512 
2U3 

Uoo 

605 

129S 

U786 


136 

128 

31 
130 

116 

7 

•^1 

ik 
16 
Ui 
1*1 


0 
Ik 

56 
36 
100 
70 
72 

U9 
Uo 

18 

Uo 
99 

180 


0 

128 

62 

390 

35 

258 

280 
2U8 
126 
160 

U51 

i02 


705 

737 

166 

761 
6U6 

82 

279 
297 
238 

97 
105 

196 

701 


0 

737 
332 

228;j 

2584 

UlO 

167U 

2079 
I90I* 

S73 
1050 

2156 

3612 


(77^)   309^   (U610)   1969^ 


Form  2  Sheet  #5. 


(I7I) 


0 

75  s 
1382 

83U 
25SH 

1S£5 
1998 

1U07 
1112 

585 

13^ 

3256 

6852 

5890   (U610)   23573 


0 

Ik 

112 
108 

Uoo 

350 
U32 

320 
162 

Uoo 

1089 

2160 


525 

358 

091 
278 

6U6 

373 

333 
201 

139 

65 
13J 

296 
IZi 
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The  next  step  is  to  transcribe  figures  from  the  sheets  (l  to 
5)  of  Form  2,    to  another  sheet  of  the  form  shov/n  on  the  following 
page   (Form  3). 

On  line  A-1  (Form  3)  col"uiiin  A  list  the  figure  taken  fron  Form 
2;    Sh.l  Col.  6,   last  (total)  line.     The  other  figures  on  line  A-1  are 
taken  from  the   same   sheet   (Form  2;    Sh.   l)  last  line,   columns  8,   10, 
12,   lU  &  16.     The  figures  filled  into  form  3  ^Pply  to  our  exainple,    so 
they  may  be   traced  through  the  various  forms. 

The  data  for  line  B-l  comes  from  Form  2,    Sh.   2,    last  line, 
columns  S,    10,    12,    lU,   &  16.      (This  was  the   sheet  used  when  the  cards 
were   sorted  on  B . ) 

The  data  for  line  C-1  comes  from  Form  2,    Sh.   3,    last  line, 
columns  10,    12,    lU,   &  l6«      (This  was  the  sheet  used  when  the  cards 
were   sorted  on  C . ) 

The  data  for  line  D-1  comes  from  Form  2,    Sh.   U,    last  line, 
columns  12,    lU,   &  I6.      (This  was  the  sheet  used  when  the  cards  were 
sorted  on  D. ) 

The  data  for  line  X-1  comes  from  Form  2,   Sh.   5,   last  line, 
columns  14,  &  I6.      (Oiis  was  the   sheet  used  when  the  cards  were   sorted 
on  X)       We  have  nov/  filled  in  all  that  has  to  go  on  the   lines  ending 
in  "1"  (i.e.:  A-1,  B-l  &c  &c)  on  Form  3. 

To  obtain  the  figures  to  go  on  the  lines  ending  in  "2"  on  Form 
3,  we  make  computations  from  the  data  on  Form  1.     The   sum  of  the  vari- 
able A  (1302  in  our  example)  is  put  into  a  computing  machine  and  mul- 
tiplied successively  by  the  mean  of  A,   of  B,   of  C  <Sc.   &c 


.    \ 
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Fcrrr  3 
Tranecripiiiow  of  Sums 

of  J'xts^risions. 

Li:  10 

A                B 

c 

D 

X 

6153.0 

f.'7J3.2 

259.3 

s 

A-1 

A-2 
A-3 

12216.0         5W>-0 

99'i3.'i       5367.9 
2302.6           27.1 

9077.0 

37^^3.5 
323.5 

50UU.O 

5177.5 
-133.5 

37335.0 
35100.5 

2?8U.5 

B-1 

B-2 
B-3 

36f5.o 
k)'<S .  6 

4?7U . 0 

'+737. 1 
136.9 

2719.0 
2?03 . 5 

-  Siv.5 

5100.0 

3191.0 
-91.0 

197^^3.0 
19006 . 1 

736.9 

C-1 

C-2 
C-3 

1XS9 . 0 
7720  5 

2373.5 

J+051 . 0 

U560 . 1 
-513.1 

5336.0 

5200.7 

135.3 

33^37.0 

30975.9 
2^61.1 

D-1 

D-2 
D-3 

4786 . 0 
270'+.  1 

2081.9 

309^.0 

3077.9 
16.1 

1969^.0 

13332,1 

1361.9 

X-1 
X-2 

X-3 

5390.0 

3503.4 

23S6 . 0 

23573.0 

20366 . 2 
2706.3 

4 
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The  products  do  obtained  are  listed  respectively  in  Columns  A,  B,  C  &c., 
on  Form  3,  line  A-p. 

Next  the  f5van  of  the  second  Variable,  (B  an  our  example;  or  705) 
is  pat  into  the  computing  machine  and  multiplied  successively  by  the 
mean  of  B,  of  C,  of  D  <Sc.  (?c. .  The  products  so  obtained  are  listed 
respectively  5n  Colimns  B,  C,  D,  &c.  of  Form  3,  line  B-2. 

Next  the  sum  of  the  third  Variable,  (C  in  our  example;  or  IIU9) 
is  put  into  the  ccmpiitirg  rachine  and  multiplied  successively  by  the 
mean  of  C^  of  C  &c.  Lc.,     The  products  so  obtained  are  listed  respec- 
tively in  Columns  C,  D,  &c.   of  Form  3,  line  C-2. 

In  a  similar  manner  the  computations  are  iLade  for  the  other 
lines  ending  in  "2",  Form  3. 

Note:-  In  practice  it  is  most  convenient  to  prepare  Fnrm  3  on 
a  sheet  of  paper  which  also  carries  Fonr.  1  at  the  top.  The  figures 
for  making  the  extension  for  lines  of  designation  ending  in  "2"  are 
then  before  the  operator. 

Every  item  on  a  line  ending  in  "2"  is  now  subtracted  from  the 
figure  directly  above  it  on  a  line  ending  in  "1."  (Note:  naturally 
should  the  minuend  be  greater  than  the  subtrahend  the  difference  will 
be  a  negative  value.)  These  differences  are  listed  on  the  lines  of 
designation  ending  in  "3".  These  lines  are  then  transcribed  to  Form  k. 

The  differences  which  have  just  been  secured  are  the  product 
moments  and  squared  standard  deviations  (times  N);  and  are  the  neces- 
sary data  for. any  solution  of  multiple,  and  partial  correlation  coef- 
ficients, or  gross  ard  net  regression  coefficients.  The  usual  solution 
may  be  found  in  Yule:   "Introduction  to  Statistical  Method."  The  solu- 
tion given  in  the  following,  however,  is  a  "Least  Sqjoare"  method,  first 
conceived  of  and  developed  by  Messrs.  Tolley  and  Ezekiel  of  the  Bureau 
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of  AgricultTiral  Economics.  ^  article  describing  the  theory  of  the 
method  is  published  by  them  in  the  Journal  of  the  i^.  Stat.  Assoc, 
for  December,    13 >i. 

Note:-  In  case  it  is  decided  to  make  the  extensions  oy  hand 
rather  than  to  use  punch  cards  and  tabulating  machines— frequently 
the  case  when  short  series,    such  as  time   series,   are  being  analyzed, - 
a  mMlti -columnar  form  should  be  used.     In  the   six  left-hand  colaamns' 
list  the  data;    column  headings  would  be  A,    B^   C,  D^  X,   &  S.     The 
remaining  columns  should  be  headed:     A^.AB    k    *  AD*  AX    AS;*   B^     BC 
BD,   BX,   BS;    C^,    CD,   CX,    CS;  D^   DX,   DS;   X^,'   Xs!     In  the  A^  col^nn   ' 
write  the   squares  of  the  values  in  the  A  Column.      In  the  AB  column 
write  the  products  of  the  A  items  times  their  corresponding  B  items, 
&c  <Sc.  When  all  columns  have  been  extended,  add  them,    listing  the 

totals  below,    in  their  respective  cclumns. 

Find  the  means   (averages)   of  the  A,   B,   C,  D,   X,   &  S  columns! 
Multiply  the  sum  of  the  A  column  tines  each  of  the  means  of  the  A, 
^5     ijP*  p '   *  ^  column's  and  write  the  products  below  the   sums  of  the 
A  ,  AB,  AC,   AD,  AX,   &  AS  columns  respectively.     Multiply  the   sum  of 
the  B  coluirji  times  the  means  of  the  B,  C,  D,   X,   &  S  columns  and  write 
the  products  below  the   sums  of  the  B^,   BC,   BD,   BX,    &  BS,   columns.      In 
a  similar  manner  extend  the  sum  of  the  C  column  times  the  means  of 
C,   D,   A,   &  S;   and   inscrioe  the  products  in  the  C2^   CD     CX     &  CS 
columns.     Also  the  sums  of  the  D,   &  X  columns.     It  is 'not 'necessary 
to  multiply  the   sum  of  the   S  column  times  anything. 

Now  subtract  the  last  vajues  listed  in  the  A^  column  and  in 
columns  to  the  rigiit  thereof  from  the  figui-es  just  above  them.     If  the 
minuend  should  be  greater  than  the   subtrahend  the  difference  would 
mturally  be  a  negative  value,      Tncse  differences  are  now  to  be   trans- 
ferred to  a  now  sheet  of  the  arrangement  shown  in  form  k.     The  dif- 
ferences m  the  columns  commencing  with  an   "A"  in  their  designation 
are  transferred  to  the  first  line -of  form  U,   designated  as  line  A-3. 
me  aifferences  in  the  columns  coiiimencing  with  a  "B"  in  their  desig- 
nation  Uhis  of  course  includes  the  b2  col.)  are   transferred  to   line 
f-3  of  form  4.       The  remaining  differences  are   transferred  in  a  sim- 
lAar  manner.     The  so  arranged  differences  constitute  the  Normal  Equa- 

IV'T  I     r  ^T?.^'   ^^-^t'*^  solution  for  the  value  of  the  net  regressions 
01  ii.,   13,   t,    &  D,   on  X. 
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At  this  point  the  use  of  the  Check  Sm,  (Col.    S)  as  a  check  on 
the  Vv'ork  to  this  point  may  be   shown:     On  Line  A-1   (Form  3)   the   sum  of 
the  items  in  col-uzrois  A,   B^   C,   D  &  X  should  eoual  the  figure  in  Column 
S  (Line  A-l)^    thus  checking  the  extension  and  addition  of  all  the 
fi gores  used  in  connection  with  deriving  these  values. 


The   Sum  of  the   following.* 


Line 

A.-l 

Column 

B-1 

B 

ti 

G 

II 

D 

II 

X 

Should  check  to: 


The  Sum  of  the  following: 


B-1 


S 


A-l 

c 

B-1 

c 

C-1 

c 

II 

B 

II 

X 

Should  check  to 


The   Sum  of  the   following: 


C-1 


S 


A-l 

s 

B-l 

s 

C-1 

fi 

D-i 

p 

n 

z 

Should  check  to: 


The  sum  of  the  following: 


D-1 


S 


A-l 

X 

B-l 

X 

C-1 

X 

D-1 

X 

X-1 

X 

Should  check  to: 


X-1 


By  substituting     "2"  for   "1";   and  also    "3"  for   "1"  in  the  above 
schedule,  a  further  check  may  be   socured.     It  is  essential  that  the 
values  should  all  check,  before  the  work  is  carried  to  a  further  stage. 
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(Form  4) 

NOPVxAL  SOUATIONS 

Line# 

A 

B                 C 

L 

X 

S 

PM 

A-3 

2302.6 

27.1       328.5 

-133.5 

259. 8 

2731^.5 

3-3 

7Ug.W        136.9 

-8U.5 

-91.0 

736.9 

C-3 

2378.5 

-51s . 1 

135.3 

2U6I.I 

D-3 

2081.9 

16.1 

1361.9 

X-3 

2386.6 

2706 . 8 

SOLUTION 

1 

2302.6 

27.1        328.5 

-133.5 

259.9 

2731^.5 

2 

-1 . 0000 

-.0118       -.IU27 

.0580 

-.1123 

-1.2093 

3 

7U8.I+         136.9 

-34,5 

-91.0 

736.9 

U 

-.3           -3.9 

1.6 

-3.1 

-32.8 

5 

7U8.I         133.0 

-82.9 

-^\.\ 

70U.I 

6 

•l.COOO       -.17?8 

.1108 

.1258 

-.9IH2 

7 

2373.5 

-5I8.I 

135.3 

2461.1 

8 

-U6.9 

19.0 

-37.1 

-397.2 

9 

-23.7 

lU.S 

16.7 

-125.2 

10 

2307.9 

-U8U.3 

IIU.9 

1938.7 

11 

-1.0000 

.  2098 

-.OU98 

- . 8UOO 

12 

2081.9 

16.1 

1361. 9 

13 

-7-7 

15.1 

161.1+ 

14 

-9.2 

-10.1; 

78.0 

15 

-101.6 

2U.1 

406.8 

16 

1963.1+ 

U4.9 

2008.1 

17 

-1 . 0000 

-.022<3 

-1 . 0228 

IS 

D 

:    .0229 

X       16.1  . 

.4 

19 

c:  .oiigs 

.0048 

:    .0546 

2       135.3 

:          7.4 

20 

B: 

'^-.1258  -.0097 

.0025 

: - . 1330 

X.    -91.0  ; 

12.1 

a      A 

.1128 

.0016  -.0078 

.0013 

:    .1079 

X    259.8  : 

;       S8.0 

22 

P.M.; 

48.0 

23 

Sq.  Hoot   : 

:         6.91 

2U 

2Ui3.U5 

3.60     17.9 

-3.05 

:    250.79 

25 

Square  root  of  23Sb.6  (See  Line 

X-3:   Col.X)             : 

■       48.8 

B     e( 

luals    6.9'+  -f  Us 

.8           or 

.142 
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On  Form  U,  Lines  A-3,  B-3,  C-3.  and  D-3,  are  the  data  compri  s- 
ing  the  normal  eq.'uations  v/hich  are  to  be  solved.  The  method  of  solu- 
tion is  given  on  lines  1  to  25,  ^s  described  below.* 

On  Line   1  write  the  first  normal  eq\:iation,   i.e.,   copy  Line  A-3. 
Now,  divide  every  item  on  Line   1  by  the  first  item  of  Line   1,   reverse 
the  algebraic  signs  and  list  the  q.uotients  on  Line  2.     In  our  example, 
we  divide  by  2302. S.     The  algebraic  sum  of  the  items  on  Line  2  Columns 
A,    B,   C^   D^   and  X,    should  eqiial  the   quotient  appearing  in  Column  S  on 
the   same  line.     This  will  not  always  check  to  the   last  digit,   ovJing  to 
the  dropping  of  places  in  the  division.     Now,   draw  a  line  under  the 
figures  Just  written  in.     On  Line  3  copy  in  the   second  normal  equation, 
that  is,    copy  Line  B-3,     Now  put  the  figure  on  Line   2,    Column  B  (i.e. 
-,0118)   into  the  m-altiplying  zrachine  and  multiply  it  consecutively  hy 
the  items  on  Line  1  in  Columns  B,  C,  D,   X,  and  S,    listing  the  products 
in  the  respective  columns  on  Line  U.      In  our  example,  we  maltiply  -.0118 
by  27.1  by  328. 5  by   .133.5  by  259-9  and  by  2784.5,    giving  as  quotients 
the   items  appearing  on  Line  U,    i.e.,    giving  -.3,    -3.9;    1.6;    -3.1;    -32.8. 
Now  add  the   items  on  Line  k  to  the  items  imnediately  above  on  Line  3, 
giving  Line  5.     Careful  attention  must  be  given  to  the  algebraic  sigis. 
Now,  divide  the  figures  on  Line  5  ^y  ^-^^  first  figure  on  Line  5,  ^^cL 
reverse  the  algebraic  signs,    listing  the  quotients  on  Line  6.      In  our 
exaniple,  we  divide  by  7^^. I.     The  algebraic  sum  of  the  first  four  it^ms 
on  Line  5  should  check  to  the   last  item  of  the   line,   704.1,   in  the   S 
column.     In  like  manner,   the  algebi^aic  sum  of  the  first  four  items  on 


I 


♦This  is  the   "Doolittle  Method"   -  See  Oscar  S,  Adams  -   "Geodesy  -  Appli- 
cation of  the  Theory  of  Least   Squares  to  the  Adjustment  of  Triangula- 
tion."  -  1913.      Special  publication  #28,   Geodetic  Survey. 
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Line  6  sho^ild  check  to  the  last  item  on  Line  fa,  or  -.9^12.  Now  copy 
do'^vn  the  third  normal  equation  on  Line  7;  i.e.,  copy  Line  C-3.  Put  the 
n-umber  in  the  C  cclumn  on  the  second  line,  -.1427  into  the  multiplying 
machine  and  mnaltiply  it  consecutively  oy   the  items  in  the  C,  D,  X,  and 
S  columns  of  Line  1,  listing  the  products  in  the  corresponding  columns 
on  Line  S,  giving  carex^il  attention  to  the  algebraic  signs.  Next,  put 
the  item  in  the  C  column  of  Line  b,  -.177^,  i^to  the  siultiplying  machine 
and  multiply  it  consec^jtively  by  the  C,  D,  X^  and  S  column  figures  on 
Line  5,  listing  the  products  in  their  respective  columns  on  Line  9, 
giving  careful  attention  to  algebraic  signs.  Now,  add  together  for 
each  column  the  values  in  Lines  7,  6,   and  9,  giving  Line  10.  The  first 
items  of  this  line  shou3.d  check  to  the  last,  similar  to  the  case  for 
Lines  5  and  1.  Divide  each  of  the  items  of  this  line  by  the  first  item 
in  the  line;  that  is,  divide  oy   2307.9,  reverse  the  signs  and  list  the 
quotients  on  Line  11.  In  a  manner  similar  to  Line  o,  the  first  items 
on  this  line  should  check  to  the  last  v/hen  adael  together.  Draw  another 
line.  On  Line  12,  write  the  fourth  normal  equation,  that  is  copy  Line 
D-3.  Pu.t  into  the  multiplying  machine  the  value  on  Line  2,  Column  L, 
and  multiply  it  consecutively  by  the  D,  X,  and  S  colimn  values  of  Line 
1,  listing  the  products  on  Line  I3  in  their  respective  columns,  giving 
careful  attention  to  the  algebraic  signs.  Next,  place  the  value  on 
Line  b  coliann  D  into  the  »nultip lying  machine,  .1108,  atnd  multiply  con- 
secutively oy   the  values  on  Line  5i  colTsms  D,  X,  and  3,  listing  the 
products  in  their  respective  columns  on  Line  lU,  giving  careful  atten- 
tion to  the  algebraic  signs.  Next,  put  the  figure  on  Line  11,  column 
B,  .2095,  into  the  multiplying  machine  and  multiply  consecutively  oy 
the  values  on  Line  10  columns  D,  X,  and  S,  listing  the  products  in  their 
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respective  columns  on  Line   15,    giving  careful  attention  to  algebraic   signs. 
Next,   add  for  coliznns  D^  X,  ard  B  the  val-nes  on  Lines  12,    13,    1^,  and  15, 
lisiing  the  algebraic  sums  on  Line  l6.     Cc^reful  attention  should  be  given 
to  algebraic  signs.     The  algebraic  sum  of  the  first;  items  on  Line   lo  should 
check  to  the  last  item,  as  was  the  case  for  Lines  10,   5,  and  1.     Pivide 
the  items  on  Line  lb  by  the  first  item  on  Line   l6,    1933.^,   reverse  the 
algebraic  signs,  and  list  the  quotients  on  Line  1?.     The  first  items  of 
Line    17  should  check  to  the   lest  item  of  Line   17,   as  was  the  case  for 
Lines  11,   b,  and  2.     We  have  now  finished  the   "forward"  solution  for  the 
h  normal  equations,  and  by  this  time  the  reaaer  presiamably  understands  the 
method  so  i^hat  the  extension  of  the   solution  to  a  greater  number  of  vari- 
ables ^;7ill  be  a  ccmcparatively  simple  matter.     We  are  now  reaay  for  the 
back  solution,  which  is  given  on  Lines  IS,   et   seq.. 
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On  Line  15,  column  X  v/rite  the  val^oe  on  Line  17  column  X;  reverse 
the  sign.  This  value  is  the  net  regression  coefficient  of  the  variable 
D  on  X.  Next  in  Column  S,  Lines  13  to  21,  inclusive,  list  in  inverse 
order  the  values  in  column  X,  Lines  A.3,  B-3,  C-3,  D-3.  Next  on  Line  19 
Column  C^  write  the  value  on  Lino  11  Column  X,  reversing  the  sign.  On 
Line  20,  column  B,  write  the  value  on  Line  6,  column  X,  reversing  the 
sign.  On  Line  21,  column  A,  write  the  value  on  Line  2  column  X,  reversing 
the  sign.  Put  the  value  on  Line  13,  column  X^  into  the  multiplying  ma- 
chine and  multiply  it  times  the  value  on  the  same  line  in  column  S,  and 
list  the  product  in  Column  PM  (Product  Moment).  Then  multiply  it  times 
the  values  in.  column  D,  Lines  11,  6  and  2,  listing  the  products  respec- 
tively  on  Lines  19,  20  and  21  in  column  D.  Multiply  it  also  times  the 
value  in  column  D  on  Line  A-3,  listing  the  product  in  Column  L  on  Line  2U. 
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Next,  add  together  the  values  on  Line  19  in  coliiinns  C  and  D,  listing  their 
sum  in  coluzim  X,  Line  I9.  TtAa  laet  suu  is  thf-  net  regression  coefficient 
of  the  variable  C  on  X. 

Put  the  net  regression  coefficient  of  the  variable  C  on  X  into  the 
multiplying  machine,   and,  having  a  care  for  algebraic   sigas,  multiply  it 
times  the  value   listed  beaide  it  in  coliomn  S,  writing  the  product  on  the 
same   line  in  column  M.     Then,  multiply  it  "ty  the  values  in  column  C  on 
Lines  6,2  and  A-3,   writing  xihe  products  r>n  Lines  20,    21  and  24,   respec- 
tively of  the  same  column.     Now,   add  the  values  on  Line  20,   columns  B,   C 
and  L  together  v/riting  the   sum  in  column  X  on  the   same   line,   having  a  par- 
ticular regard  to  algebraic  signs,     This  last  sum  written  on  Line  20  in 
column  X  is  the  net  regression  coefficient  of  the  variable  B  on  X.     Place 
it  in  the  multiplying  machine,  and,   having  a  care   to  algebraic   signs,  mul- 
tiply it  times  the  value   listed  beside  it  in  column  S,  writing  the  product 
on  the   same  line  in  column  P!A.     Thrn,   multiply  it  by  the  values  in  coluiim 
B  on  lines  2  and  A-3,    listing  the  prcaucts  respectively  on  Lines  21,  and 
2h  of  the   saiae  column.     Now,  add  the  values  on  Line  21,  and  in  columns 
^*   ^»   ^,   3.nd  D,  v^Titing  the    sum  in  Colunn  X  on  the   same   line.      This  sum 
is  the  net  regression  coefficient  of  A  on  X.     Place   it  in  the  multiplying 
machine  and  inultiply  it  times  the  value   listed  beside  it  in  column  S,   hav- 
ing a  care  for  algebraic   signs,  and  libt  the  product  in  column  R^  on  the 
same   line.     Also,   multiply  it  times  the  value  on  Line  A-3,    column  A,    list- 
ing the  product  in  the   same  crlumn  on  Line  2U.     Nov;,   the  values  on  Line   2^ 
in  columns  A,   B,    C,  L  added  together  algebraically,    should  equal  the  value 
listed  on  Line  A-.3,   column  X,   which  serves  as  a  check  upon  the  derivation 
of  other  net  regression  coefficients  than  L  on  X,     There  is  a  difference 
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of  9  between  the  two  values  In  our  exaaiple.  A  greater  accuracy  may  be  se- 
cured carrying  the  arithmetic  to  a  greater  number  of  places  throughout  the 
entire   solution.     It  was  deemed  expedient  to  make  the  example  as  sixiqple  as 

possible. 

^e  have  now  sectored  the  net  regression  coefficients,  which  are  es- 


or  esti- 


sential  to  the  forming  of  the  regression  equation  for  predicting 
mating  the  values  of  S.  To  ascertain  to  how  great  a  degree  these  predic- 
tions conform  to  the  actual  values,  it  is  necessary  to  obtain  some  measure 
of  agreement  bet*veen  them,  the  predicted  and  the  actual.  This  measure  of 
agreement  is  the  coefficient  of  multiple  correlation,  R,  defined  on  Page  1, 
To  secure  this  coefficient  of  multiple  correlation,  suid  the  values  in  the 
PM  coluan,  listing  the  sum  on  Line  22,     Next,  secure  the  square  root  of 
this  sum,  given  on  Line  23.  Finally,  secure  the  square  root  of  the  value 
listed  on  Line  X^3^  column  X,  listing  this  in  the  PM  column  on  Line  25. 
This  value  divided  into  the  value  iracGediately  above  it  on  Line  23,  gives 
the  coefficient  of  multiple  correlation. 

There  are  certain  aids  and  other  checks  in  the  solution  which  can 
be  applied  to  help  in  locating  errors.  The  diagonal  terms  of  the  normal 
equations  ( 2302. 6,  ihS.h,    237S.5  &o)   are  always  positive  in  sign.   In  mak- 
ing the  solution  che  figures  listed  immediately  below  these  figures  (to  be 
added  to  them  in  the  course  of  the  solution)  are  always  negative  in  sigi. 
The  sums  appearing  above  the  -i. 00000  terms  are  always  positive,  (i.e. 

7^3. 1,  2307.9,  1963.'^). 

Accuracy  is  increased  if  ccmpara-^ivf^ly  small  diagnonal  terms  are 
avoided.      (This  can  be  controlled  by  controlling  the  original  coding.) 

Tlie  Product  Moment,   Line  22,    is  always  positive  in  sign. 
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Tire:  HFCxRESBJON  EQUAT:Oi>T. 
The  final  step  in  the  arittoet:.c  is  to  write  the   "regression," 
or   "prediciting"  or   "estiicating"  eqiiation  as  it  is  variously  called. 
First  write  do\vn  the  regression  coefficient  of  A  on  X;   in  our  example 
this  is   .1079.     Beside  this  value  write  what  was  done  algebraically  in 
sinplifying  the  A  series:      this  will  he  of  the  form,  ^-Z-Jl  ,    (See  page 
6,   note   (1)),      Then  f\irLher  wrfte  the   subtraction  of  the  average  of 

the  A  series  as  taken  from  form  1,   enclosing  all  in  parentheses.     The 

A  -  35 
algebraic  form  so   far  will  look  like  this:      .1079(       ^         -T-ol^O  ) 

« 

In  an  exactly  siirilar  manrer  treat  the  B,   C,  D,   &  X  series.      (Lisre^rd 
the   S  sex'-ies)       There  will  be  no  regression  coefficient  for  the  X  series 
The  algebraic   sum  of  the  A,   B,  C,   &D  expressions  should  be  equated  to 


the  X  eroression  as  follows: 


( 


) 


) 


( 


0-120 


) 


*  .1079(^^  -  7.61^*^)    -  .135C(-^^  -  ^.i?26)  ^  .05^6(--p  -  6.7193) 


(    ^  ) 


X  -  95 


This  is  the    "raw"  regressio;i  eqration.      It  is  only  necessary  now  to 
evaluate  for  X^    involving  only  elementary  algebra,    to  put  the  equation 
in  its  most  useful  fomoi: 

X  =    .1079  A  -    .0993  B  +   .0273  C  -»•    .0137  D  +  99.0532 

This  is  the   "predicting^?  equation  mentioned  in  the  definition 
of  E  in  the  note   on  page  1.         * 
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